Ensembl Genomes 2016: more genomes, more complexity

نویسندگان

  • Paul J. Kersey
  • James E. Allen
  • Irina Armean
  • Sanjay Boddu
  • Bruce J. Bolt
  • Denise Carvalho-Silva
  • Mikkel B. Christensen
  • Paul Davis
  • Lee J. Falin
  • Christoph Grabmueller
  • Jay C. Humphrey
  • Arnaud Kerhornou
  • Julia Khobova
  • Naveen K. Aranganathan
  • Nicholas Langridge
  • Ernesto Lowy
  • Mark D. McDowall
  • Uma Maheswari
  • Michael Nuhn
  • Chuang Kee Ong
  • Bert Overduin
  • Michael Paulini
  • Helder Pedro
  • Emily Perry
  • Giulietta Spudich
  • Electra Tapanari
  • Brandon Walts
  • Gareth Williams
  • Marcela K. Tello-Ruiz
  • Joshua C. Stein
  • Sharon Wei
  • Doreen Ware
  • Dan M. Bolser
  • Kevin L. Howe
  • Eugene Kulesha
  • Daniel Lawson
  • Gareth Maslen
  • Daniel M. Staines
چکیده

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensembl Genomes (non-chordates): Quick tour

Ensembl Bacteria [3], Protists [4], Fungi [5], Plants [6] and Metazoa [7] (collectively, ‘Ensembl Genomes’) are five portals for genome-scale data, developed in close collaboration with scientific communities expert in the biology of individual species. Implemented using the Ensembl software suite for genome analysis and browsing, which was developed for the study of vertebrate genomes (describ...

متن کامل

European Nucleotide Archive: Quick tour

The European Nucleotide Archive [2] (ENA) provides a comprehensive, accessible and publicly available repository for nucleotide sequence data. The ENA attracts users from a multitude of research disciplines and serves as an underlying data infrastructure for other EBI services, including Ensembl [3], Ensembl Genomes [4], UniProt [5] and ArrayExpress [6]. Data submitted to the ENA are validated ...

متن کامل

Ensembl BioMarts: a hub for data retrieval across taxonomic space

For a number of years the BioMart data warehousing system has proven to be a valuable resource for scientists seeking a fast and versatile means of accessing the growing volume of genomic data provided by the Ensembl project. The launch of the Ensembl Genomes project in 2009 complemented the Ensembl project by utilizing the same visualization, interactive and programming tools to provide users ...

متن کامل

The Ensembl automatic gene annotation system.

As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, and EST sequences. The gene-building system rests on top of the core Ensembl (My...

متن کامل

The Complete Chloroplast Genomes of Asteraceae Species

Until now, twenty-seven Asteraceae complete chloroplast genomes were uncovered in the Gene bank. The highly conservative nature and slow evolutionary rate of the chloroplast genome demonstrated that it was uniform enough to perform comparative studies across different species but divergent sufficiently to capture evolutionary events, which makes it a suitable and invaluable tool or molecular ph...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 44 D1  شماره 

صفحات  -

تاریخ انتشار 2016